大型並行處理程式設計：實務導向課程：通用型 GPU 架構的演進轉變

從 NVIDIA GT200 到 Fermi 架構標誌著 第三代 GPU 計算。雖然先前的架構是以圖形為主的單元「改造」用於數學運算，但 Fermi 是從零開始專為 GPGPU（通用型 GPU） 應用而設計。

與只注重紋理單元和僵化資料平行化的 GT200 不同，Fermi 引入了統一的記憶體請求路徑。此轉變啟用了 計算思維，讓開發者得以超越簡單的二維網格映射，邁向複雜的 C++ 算法。

Fermi 引入了真正的 L1/L2 快取階層 以及符合 IEEE 754-2008 浮點運算標準。這代表研究人員不再需要為每個字節手動管理「暫存記憶體」（共享記憶體），從而能支援不規則的資料結構，並提供適合科學工程應用的雙精準度準確性。

TERMINALbash — 80x24

> Ready. Click "Run" to execute.

QUESTION 1

Which architecture is considered the true start of the 'Third Generation' of GPU computing?

GT200 (Tesla)

Fermi

G80

Fixed-function Pipeline

QUESTION 2

What memory feature was introduced in Fermi to help handle irregular data patterns?

Manual Scratchpad only

Hardware-managed L1/L2 Cache Hierarchy

Write-only Texture Buffers

Disabling Global Memory

QUESTION 3

Fermi's compliance with IEEE 754-2008 was critical for which application type?

Simple 2D Sprite Rendering

High-precision Scientific Computing (FP64)

Text Scrolling

Basic Vertex Shading

QUESTION 4

What does 'Computational Thinking' refer to in the context of the Fermi shift?

Treating the GPU as a fixed-function signal processor.

Focusing on the physics of the problem rather than manual data movement.

Manually coding assembly for every pixel.

Using only 2D textures for storage.

QUESTION 5

How did Fermi improve thread management?

It removed the concept of Warps.

It introduced sophisticated hardware thread scheduling.

It limited threads to only 32 per GPU.

It forced all threads to run the same instruction forever.